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(54) Method of automatically selecting Oligonucleotide Hybridization probes 



(57) A method is provided for automatically select- 
ing oligonucleotide hybridization probes for detecting a 
mutation causing a mismatch in a DNA duplex. This 
method comprising the steps of: defining the mutation 
to be detected by a DNA sequence of a mutant DNA 
strand and a DNA sequence of a wild-type DNA strand 
corresponding to the mutant DNA strand; defining an in- 
teger variable L1 for a first length of the oligonucleotide 
hybridization probes from the place of mutation in a first 
direction: defining a range from ml ton1 of possible val- 
ues of the variable L1 ; defining an integer variable L2 
for a second length of the oligonucleotide hybridization 
probes from the place of mutation in a second direction 
opposite to the first direction: defining a range from m2 
to n2 of possible values of the variable L2; selecting one 
or more kind(s) of the oligonucleotide hybridization 
probes from probes which have an one-to-one hybridi- 
zation fit to the mutant DNA strand, probes which have 
an one-to-one hybridization fit to the wild-type DNA 
strand corresponding to the mutant DNA strand, probes 
which have an one-to-one hybridization fit to an opposite 
sense mutant DNA strand which is the complementary 
DNA strand to the mutant DNA strand, and probes which 



have an one-to-one hybridization fit to an opposite 
sense wild-type DNA strand which is the complementa- 
ry DNA strand to the wild-type DNA strand correspond- 
ing to the mutant DNA strand; determining all possible 
hybridization probes from the possible values of the var- 
iables L1 and L2, and the kind(s) of probes; defining a 
thermodynamic nearest-neighbour model for calculat- 
ing a melting point of a certain hybridization probe hy- 
bridized to a DNA strand, the melting point being a tem- 
perature at which a predetermined percentage of a mul- 
titude of identical pairs of said certain hybridization 
probe and said DNA strand is in an annealed state; cal- 
culating, on basis of the thermodynamic nearest-neigh- 
bour model, for all possible hybridization probes a first 
melting point of the respective probe hybridized with its 
complementary mutant DNA strand or opposite sense 
mutant DNA strand, respectively, a second melting point 
of the respective probe hybridized with its complemen- 
tary wild-type DNA strand or opposite sense wild-type 
DNA strand, respectively, and a temperature difference 
between the second melting point and the first melting 
point; and ranking the possible hybridization probes with 
regard to the temperature difference. 
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Description 

[0001] The invention relates to a method of automatically selecting oligonucleotide hybridization probes for detecting 
a mutation causing a mismatch in a DNA duplex. 

5 [0002] The hybridization of oligonucleotide hybridization probes is a technique with widespread application in mo- 
lecular biology. It has been used for the detection of immobilized nucleic acids in northern or southern blotting and for 
allele specific oligonucleotide hybridization. Provided stringency conditions are met, only a perfect complementary 
hybridization probe will basepair with the DNA strand observed. The method can therefore be used for mutation de- 
tection when the presence of a mismatch caused by a base exchange disrupts the Watson-Crick pairing and destabi- 

10 lizes the partial duplex formed by the hybridization probe and the DNA strand observed. 

[0003] A known possible tool for monitoring the hybridization of a labeled hybridization probe to a DNA strand to be 
observed in an homogenous assay is the LightCycler (Trademark, manufactured by Roche Molecular Biochemicals). 
Three different wavelengths are available for the optical detection of oligonucleotide hybridization when an appropriate 
fluorescence resonance energy transfer (FRET) pair is formed by adjacent hybridization of two dye labeled hybridization 

15 probes with the DNA strand to be observed. One of the two hybridization probes is called the anchor probe while the 
other probe which is directly sensitive to the mutation to be detected is called the detection probe. The monitored 
fluorescence signal is proportional to the amount of the hybridized hybridization probe pairs. The temperature at which 
a certain percentage (normally 50%) of the hybridization probe pairs is hybridized or in a so-called annealed state is 
called the melting point Tm of the hybridization probe pair. This melting point can also be regarded as the melting point 

20 of only the detection probe as the melting point of the anchor probe is typically set to be considerably higher than that 
of the detection probe. 

[0004] For mutation detection the Tm shift caused by the mutation should be as high as possible to ensure good 
discrimination of heterozygotes. 

[0005] General guidelines for the construction of hybridization probes for use in both, quantification and mutation 
25 detection, have been given [1], but these depend on an accurate estimation of DNA melting temperatures. 

[0006] The melting temperature of short (<20 bp) oligonucleotides is often estimated with the Wallace/lkatura rule 
Tm=2°C (A-T bp) + 4°C (G-C bp). Although this approximation assumes a salt concentration of 0.9M NaCI, typical for 
dot blots and other hybridizations, it also works well for PCR applications which are not very sensitive to different Tms. 
However, for hybridization probes a more accurate estimation of Tm is required. 
30 [0007] From US 3 556 749 it is known to automatically select oligonucleotide hybridization probes taking their re- 
spective melting points into consideration. However, this known method proves not to be accurate. 
[0008] Therefore, it is an object of this invention to provide a method of automatically selecting oligonucleotide hy- 
bridization probes for detecting a mutation causing a mismatch in a DNA duplex by a sufficient accurate estimation of 
their Tm. 

35 [0009] This object is achieved by a method according to claim 1 . 

[0010] Advantageous embodiments of this method are outlined in the subclaims 2 to 9. 

[0011] The nearest-neighbor (n-n) model is based on thermodynamic calculations and gives the most precise pre- 
diction of oligonucleotide stability. This model assumes that the thermodynamic parameters for a given pair depend 
only on the identity of its adjacent pair and that these nearest neighbor parameters are pairwise additive. The stability 

40 of a given oligonucleotide sequence is a function of two basic sources, the sum of the interstrand H bonding between 
Watson-Crick paired bases and the intrastrand base stacking. The application of this model for calculation of DNA 
duplex stability will be detailed in the methods section of this chapter. The derivation of the used formulas can be found 
in the literature [2] and will not be given here. Several data sets are now available that describe the ten n-n pairs that 
occur in double stranded oligonucleotide DNA [3-6]. The contribution of a mismatch to the duplex stability depends on 

45 the location and orientation and on the neighboring bases as well [3]. The destabilizing effect of the 48 possible single 
mismatches can also be taken into account if the n-n data of the respective mismatches are used for the calculation 
of the melting temperature [3;7-10]. Oligonucleotides with repetitive sequences or strings of A-T base pairs may deviate 
from the n-n mode! as well as molecules which do not melt in a two state (all or nothing) manner [11]. Hybridization 
conditions (ionic strength and probe concentration) also influence the Tm and must be considered. It could be proved 

so that the n-n model is able to predict the melting points observed with hybridization probe assays with a standard error 
of less than 1°C. The n-n model is most beneficial for the thermodynamic predictions of oligonucleotide DNA. In longer 
DNA strands, interactions that are independent of the neighboring bases become increasingly important. This reduces 
the utility of the n-n model for the prediction of longer (>150 bp) DNA duplexes. 

[0012] In the following the invention will be described in more detail with regard to preferred embodiments of the new 
55 method. Further, a brief description on how thermodynamic data for use in the new method can be calculated will be 
given. Next, the application of the n-n model to hybridization probe assays on the LightCycler will be validated. At last 
a general comment will be given on sensitivity and specificity of hybridization probes selected by the new method. 
[0013] In the Figures: 
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Figure 1 is a flux diagram of a first embodiment of the new method; 
Figure 2 is a flux diagram of a second embodiment of the new method; 
Figure 3 is a flux diagram of a third embodiment of the new method; 

Figure 4 shows an example DNA duplex of a 18mer oligonucleotide; there are 17 n-n doublets and initiation en- 
s ergies for termination with G-C and A-T must be considered; the oligonucleotide is non-self -complementary; 

Figure 5 is a diagram of experimentally observed melting points versus melting points calculated with the n-n 
model; the regression was calculated according to Passing and Bablok [33], while the standard error of the estimate 
(Sy| X ) was calculated with the principal component method; and 

Figure 6 is a contour plot showing the interdependence of the melting temperature shift of 19mer probes destabi- 
10 lized by a single mutation versus their GC content; the different frequencies are shown in shadings. 

[001 4] As indicated in Figure 1 , a method of automatically selecting oligonucleotide hybridization probes for detecting 
a mutation causing a mismatch in a DNA duplex starts with the step 1 of defining the mutation to be detected by a DNA 
sequence of a mutant DNA strand and a DNA sequence of a wild-type DNA strand corresponding to the mutant DNA 

'5 strand. Here, it is unimportant whether the mutant DNA strand and the wild-type DNA strand are sense or antisense 
strands as the sense strand always defines the antisense strand and vice versa. In the following as well as in the claims 
a term opposite sense strand is used to refer to the strand having the sense opposite to the strand dined in step 1 . 
Therefore, opposite sense strand is no other formulation for antisense strand but may also refer to a sense strand. 
[0015] In step 2 of the method integer variables L1 and L2 are defined for a first and a second lengths of the oligo- 

20 nucleotide hybridization probes from the place of mutation in a first and a second direction opposite to the first direction. 
Further, ranges from ml ton1 and from m2 to n2 of possible values of the variables L1 andl_2, respectively, are defined. 
[0016] Alternatively, step 2 can include defining a first variable for an end point of the oligonucleotide hybridization 
probes being located from the place of mutation in a first direction; and defining a second variable for a length of the 
hybriddization probes in a second direction opposite to the first direction. Then, first and second ranges are to be 

25 defined for the first and the second variables, in which the second range is dependent of the first range as the hybrid- 
ization probes have in any case to cover the place of mutation. 

[0017] At least in the end, both alternatives of step 2 have the same results. Therefore } they are in the following as 
well as in the claims regarded as identical. 

[0018] In step 3 one or more kind(s) of the oligonucleotide hybridization probes are selected from 

30 

probes which have an one-to-one hybridization fit to the mutant DNA strand, 

probes which have an one-to-one hybridization fit to the wild-type DNA strand corresponding to the mutant DNA 
strand, 

probes which have an one-to-one hybridization fit to an opposite sense mutant DNA strand which is the comple- 
35 mentary DNA strand to the mutant DNA strand, and 

probes which have an one-to-one hybridization fit to an opposite sense wild-type DNA strand which is the com- 
plementary DNA strand to the wild-type DNA strand corresponding to the mutant DNA strand. 

All of the above kinds can be selected, if no special requirements are given. If such requirements exist, special kinds 
*o of the oligonucleotide hybridization probes can be selectively selected or deselected. 

[0019] In step 4 all possible hybridization probes are determined from the input data of steps 2 and 3, i.e. from the 
possible values of the variables L1 and L2, and the kind(s) of probes. 

[0020] In step 5 a thermodynamic nearest-neighbor (n-n) model for calculating a melting point of a certain hybridi- 
zation probe hybridized to a DNA strand is defined. That n-n model will be discussed in detail below. The melting point 
45 is a temperature at which a predetermined percentage of a multitude of identical pairs of said certain hybridization 
probe and said DNA strand is in an annealed state, i.e. hybridized. 

[0021] In step 6, on basis of the thermodynamic nearest-neighbor mode!, the following values are calculated for all 
possible hybridization probes: 

50 a first melting point of the respective probe hybridized with its complementary mutant DNA strand or opposite 

sense mutant DNA strand, respectively, 

a second melting point of the respective probe hybridized with its complementary wild-type DNA strand or opposite 
sense wild-type DNA strand, respectively, and 

a temperature difference between the second melting point and the first melting point. 

55 

[0022] At last, in step 7. the possible hybridization probes are ranked with regard to the temperature difference, and 
a ranked list of the hybridization probes is output. 

[0023] According to Figure 2, a preferred embodiment of the new method further includes step 8 in which a temper- 
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ature range of a hybridization assay to be employed for detecting the mutation using the oligonucleotide hybridization 
probes is defined. Then in step 9 following step 6 and preceding step 7, it is checked for each of the possible hybrid- 
ization probes whether both the first and the second melting points are within the temperature range of the hybridization 
assay. Here, In step 7 only those possible hybridization probes are ranked which have both their melting points within 
5 the temperature range of the hybridization assay. 

[0024] As the maximum Tm shift caused by the mutation is limited, it is also sufficient to define a narrower temperature 
range in step 8 S and to only check whether that melting point corresponding to the narrower temperature range is within 
said temperature range. 

[0025] The actual temperature range is dependent of the hybridization assay to be employed. However, it will normally 

10 be defined within a range from 20 to 80 °C. 

[0026] According to Figure 2, a further step 1 0 is added to the method according to Figure 1 . In step 1 0 at least one 
physico-chemical parameter of the hybridization assay to be employed for detecting the mutation using the oligonu- 
cleotide hybridization probes is additionally defined. Correspondingly, step 5 includes defining the thermodynamic 
nearest-neighbor model dependently of the at least one physico-chemical parameter of the hybridization assay. The 

15 at least one physico-chemical parameter of the hybridization assay can for example be an ion concentration under 
which the hybridization of the hybridization probes is to be monitored. Such an ion concentration can be a total cation 
concentration denoted as a Na+ cation equivalent concentration. 

[0027] According to Figure 3, a further step 11 is added to the method according to Figure 2. In step 10 an integer 
number N of the hybridization probes to be automatically selected is defined. Correspondingly, in step 7 only a list of 

20 the N top ranked possible hybridization probes is output. Alternatively, in step 7 the possible hybridization probes can 
be separately ranked for each of the kinds of the oligonucleotide hybridization probes, and the N top ranked possible 
hybridization probes can be output for each of the kinds of the oligonucleotide hybridization probes. 
[0028] Further, step 7 can include checking, for each of the possible hybridization probes, whether it is able to hy- 
bridize either to the mutant DNA strand and the opposite sense mutant DNA strand, or to the wild-type DNA strand 

25 and the opposite sense wild-type DNA strand in more than one way. Then, ranking the possible hybridization probes 
with regard to the temperature difference may include ranking only those possible hybridization probes which are 
neither able to hybridize to the mutant DNA strand and the opposite sense mutant DNA strand, nor to the wild-type 
DNA strand and the opposite sense wild-type DNA strand in more than one way. 

[0029] In an optional step 1 2 one or more further mutations which may occur on the mutant DNA strand or the wild- 
oo type DNA-strand or the opposite sense wild-type DNA strand may be defined, so that in step 7 it may be checked, for 
each of the possible hybridization probes, whether it is sensitive to the further mutation in the same way as to the 
mutation. Thus ranking the possible hybridization probes with regard to the temperature difference may be limited to 
ranking only those possible hybridization probes which are not sensitive to the further mutation in the same way as to 
the mutation. 

35 [0030] Further refinements due to special requirements for the possible hybridization probes to be selected are pos- 
sible and easily accomplished by those skilled in the art. 

[0031] Now, a theoretical background is given for the n-n Model to be defined in step 5 of the methods according to 
Figures 1 to 3. 

[0032] The enthalpy (AH°) and entropy (AS 9 ) of an oligonucleotide duplex is needed for the calculation of its melting 
40 temperature: 



T m = — — 273.15 [Eq1] 

AS [A/a+]4-f?xln(CT) 

45 

where R is the gas constant (1.987 cal/Kxmol), CT is the strand concentration, AS* [Na] is the entropy at a given 
sodium equivalent concentration. For non-self-complementary sequences CT is the concentration of oligonucleotides 
divided by 4. A sequence is considered to be self-complementary if two strands of the same sequence can form a duplex. 
[0033] For the calculation of Tm AH 0 needs to be calculated from Eq 2 and AS 9 [Na] from Eqs 3-5 knowing CT 
so According to the n-n model, energy for helix formation is the sum of three terms, one for helix propagation energy from 
the sum of every base pair, one for an energy change caused by helix initiation and one to account for effects caused 
by duplex formation of self-complementary strands. The following formula applies for the calculation of AH° for an 
oligonucleotide: 

AH 0 tota , = Lp£H°(i) + AH°(5'init) + AH°(3'init) + AH°(symmetry) [Eq 2] 
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AH 0 (/) is the enthalpy change for the ten possible n-n and n, is the number of occurrences of the respective doublet. 
[0034] For AS 0 a similar formula applies: 

5 AS 0 total = r / n / AS°(/) + AS°(5 , init) + AS°(3'init) + AS°(symmetry) [Eq 3] 

[0035] For Eq 2 and Eq 3 it should be noted, that the symmetry correction is zero for non-self-complementary oligo- 
nucleotides. The AH* and AS* values for Watson-Crick nearest neighbors with matched and mismatched basepairs as 
well as those for initiation and symmetry correction were taken from published data and are summarized in tables 1 
*o and 2. A standard PCR buffer contains 50 mM KCI and 1 .5 mM Mg 2+ , for rapid cycling applications higher Mg concen- 
trations are often used. Mg 2+ cations have a 140 times higher stabilizing effect on duplexes compared to Na+ [13]. 
The concentrations of the monovalent ions do not add much to this effect, at least under the conditions which apply 
for PCR [13]. Therefore the sodium equivalents are calculated as follows: 
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Na + equivalent = [Mg ] X 140 [Eq 4a] 

[0036] Alternatively a formula based on a 1 00 times higher DNA stabilizing effect of Mg 2+ cations may be used. This 
also takes into account the monovalent ions [14]: 

Na + equivalent = [Mg 2+ ] x 100 4- monovalent ions [Eq 4b] 

[0037] The entropy is salt dependent and therefore AS 0 must be corrected if the sodium concentration is different 
from 1M NaCI, the condition under which the parameters tabulated in table 1 were derived [5]. 

AS 0 [Na] - AS°[1M Na] + 0.368 X N X In [Na + ] [Eq 5] 

where N is equal to the oligonucleotide length minus 1 . 

If DMSO is included in a PCR reaction, it induces a Tm reduction of 0.6°C for each percent DMSO [15]. 

1 . Example: Calculation of Tm for a 18mer without mismatches 

[0038] As an example the Tm of the 1 8 mer 

5'-ACG ATG GCA GTA GCA TGC-3* according to Figure 4 is calculated. 
3'-TGC TAC CGT CAT CGT ACG-5* 

[0039] This example oligonucleotide has 17 n-n doublets. These are AC/TG, CG/GC, GA/CT, AT/TA, TG/AC, GG/ 
CC, GC/CG and so on, see Figure 4. 

It should be noted that the first doublet (AC/TG) is identical with GT/CA. Only the thermodynamic data for latter doublet 
the is listed in table 1 to save space. Palindromic sequences are not listed twice. From the n-n given in table 1, the 
number of different doublets found in the above 18mer oligonucleotide are as follows: 

N AA/TT 0 S N AT/TA 2, N TA/AT 1 , N CA/GT N GT/CA 2, N CT/GA 2 : N GA/CT 1 , N CG/GC 1 , N GC/CG 3, N GG/CC 1 

[0040] With data from table 1 AH 0 can be calculated according to Eq 2: 

ZpfiH 0 (/) results in Ox (-7.9)+2x(-7.2)+1 x (-7.2)+4x(-8.5)+2x (-8.4)+2x (-7.8J+1 x (-8.2)+1 x 
(-10.6)+3x (-9.8)+1x (-8.0)=-144.2 kcal/mol 

[0041] The initiation term for each direction is 0.1 kcal/mol for initiation with terminal G-C and 2.3 kcal/mol for initiation 
with terminal A-T 

No symmetry correction is necessary since the oligonucleotide is non-self-complementary. AH 0 tolal = -144.2 + 0.1 + 

2.3 kcal/mol = -141.8 kcal/mol. 

[0042] AS 0 calculation according to Eq 3: 
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I, n,- AS°(/) results in 0x(-22.2)+2x(-20.4)+1 x (-21.3)+4x(-22.7)+2X(-22.4)+2X(-21 .0)+1 X 
(-22.2)4-1 x (-27.2)+3x (-24.4)+ 1 X (-19.9) = -382.2 cal/Kxmol 

The initiation term for each direction is -2.8 cal/Kxmol for initiation with terminal G-C and 4.1 cal/Kxmol for initiation 
with terminal A-T, total +1 .3 cal/Kxmol. 

No symmetry correction is necessary since the oligonucleotide is non-self -complementary. 

* s totai = " 380 - 9 cal/Kxmol 
[0043] Sodium equivalents calculation according to Eq 4a: 

Assuming the PCR contains 2.5 mM Mg 2+ , according to Eq 4a the sodium equivalent is 2.5 mM X 140 = 350 mM = 
0.35 M 

[0044] Calculating AS for a given sodium equivalent according to Eq 5: 
For a 1 8 bp duplex is N=1 7. 



AS [Na]= -380.9 cal/Kxmol + 0.368 x 1 7 x In [0.35] = -387.5 cal/Kxmol 
[0045] Calculating Tm according to Eq 1 

Because of the different units AH must be multiplied by 1000 to cal/mol. To convert from Kelvin to Celsius subtract 
273.15. 

The concentration of the probe is 0.1 u,M = 0.1 x 10e-6 M and this must be divided by 4 because the sequence is non- 
self-complementary. 



-141800 kalfmol 

T m = 7— t- - 273.1 5 = 62.7° C 



- 387.5 call Kx mol + 1.987 call Kx mol x In 



( Q.lxlQ- 'mol} 
{ i J 



2. Example: Calculation of Tm for a 18mer with mismatches 



[0046] As an example the same 1 8 mer 5'-ACG ATG GCA GTA GCA TGC-3' as above is taken assuming a G->A 
mutation at the third base. Hybridization of the wild type probe with the mutated strand leads to the following duplex 
with a G-T mismatch in the trimer CGA/GTT: 



5 '-AC£ ATG GCA GTA GCA TGC-3' 
3 , -TGITAC CGT CAT CGT ACG-5' 



[0047] The oligonucleotide example has 1 5 Watson-Crick paired doublets (table 1 ) and 2 mismatched doublets (table 
2). These are AC/TG, CG/GT, GA/TT, AT/TA, TG/AC, GG/CC, GC/CG and so on. The number of different doublets are: 

N AA/TT 0|N AT/TA ^' N TA/AT 1 . N CA/GT 4 . N GT/CA 2 « N CT/GA 1 . N GA/CT 1 . N CG/GC °' N GC/CG 3 . N GG/CC 1 ■ The tWO 

doublets affected by the mutation are: N cg/gt and N ga/tt- Eacn °^ these doublets occurs once. The remaining n-n 
pairs are equal to example 1 . 

[0048] With data also from table 2 AH 0 can be calculated according to Eq 2: 

For the Watson-Crick paired bases Z, n,AH*(/) results in 0x(-7.9)+2x(-7.2)+1x(-7.2)+4x(-8.5)4-2x(-8.4)+2x(-7.8)+0x(- 
8.2)+0x(-1 0.6)4-3x(-9.8)+lx(-8.0) = -1 25.4 kcal/mol. Two doublets are affected by mismatches and their thermodynamic 
properties are found in table 2. For the CG/GT doublet AH 0 = -4.1 kcal/mol, for GA/TT (look up TT/GA in table 2, this 
is the same as GA/TT) AH 0 = -1 .3 kcal/mol. 

Furthermore, the initiation term is added for each direction, that is 0.1 kcal/mol for initiation with terminal G-C and 2.3 
kcal/mol for initiation with terminal A-T. 
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No symmetry correction is necessary as the oligonucleotide is non-self -complementary. 

AH 0 tota , = -125.4-4.1 - 1.3 +0.1 +2.3 = -128.4 kcal/mol 

5 

[0049] AS 0 calculation according to Eq 3: 

For the Watson-Crick paired bases E^AS 0 ^) results in 0x(-22.2)+2x(-20.4)+1x(-21 .3)+4x(-22.7)+2x(-22.4)+2x(-21 .0) 
+0x(-22.2)+0x(-27.2)+3x(-24.4)+1x{-19.9) = -332.8 cal/K x mol. Two doublets are affected by mismatches and their 
thermodynamic properties are again found in table 2. For the CG/GT doublet AS 0 = -11.7 kcal/Kxmol, for GA/TT (look 
10 up TT/GA In table 2, this is the same as GA/TT) AS 0 = -5.3 kcal/Kxmol. 

The initiation term is added for each direction, that is -2.8 cal/Kxmol for initiation with terminal G-C and 4.1 cal/Kxmol 
for initiation with terminal A-T. 

No symmetry correction is necessary as the oligonucleotide is non-self-complementary. AS° tota p-348.5 cal/Kxmol 
[0050] Sodium equivalents calculation according to Eq 4a: 
is [0051] Conditions are the same as above: 2.5 mM Mg 2+ , according to Eq 4a the sodium equivalent is 2.5 mM x 140 
= 350 mM = 0.35 M. 

[0052] Calculating AS 0 for a given sodium equivalent according to Eq 5: 
For a 1 8 bp duplex is N=1 7. 

20 

AS [Na]= -348.5 cal/Kxmol + 0.368 x 1 7 x In [0.35] = -355.1 cal/Kxmol 
[0053] Calculating Tm according to Eq 1 

Again AH° must be multiplied with 1000 to change to cal/mol. 273.15 is subtracted to convert from Kelvin to Celsius. 
25 The concentration of the probe is 0.1 u.M = 0. 1 x 1 0e-6 M and this must be divided by 4 because the sequence is non- 
self-complementary. 



-128400 kali mol 

30 j = - - 273.1 5 = 56.2° C 

( O.lxlO^moA 

- 355.1 call Kx mol + 1.987 call Kx mol x lnl J 

35 3. Example: Calculation of ATm for a hybridization probe mutation detection assay 

[0054] In the first example a Tm of 62.7°C was calculated for a 1 8mer probe. In the second calculation a lower Tm 
of 56.2°C was calculated due to a single mismatch caused by a G->A mutation. If the former probe were to hybridize 
on the latter mutated strand, a calculated reduction in Tm of 6.5° C would result. 

40 [0055] The thermodynamic calculations assume that the annealing occurs at pH 7.0. This is not the case in standard 
PCR buffer systems, but the resulting error seems to be small. More important is to note that mismatches in the terminal 
and penultimate position are not properly reflected by the parameters in table 2 [3;10]. The thermodynamic data set 
for double mismatches (i.e., immediately adjacent mismatches) is still incomplete. Only for the few double mismatches 
listed in table 2 is a reliable prediction possible. The influence of the salt concentration on oligonucleotide stability is 

45 also still under investigation [4;5;13], Nevertheless, the methods presented here will allow a reliable estimate of hy- 
bridization probe Tm on the LightCycler but may require correction of the sodium equivalents according to the local 
PCR conditions. 



50 



4. Validation of the n-n model on the LightCycler 



[0056] The formulas described above were incorporated into a MicrosoftOExcel® spreadsheet software which is 
capable of calculating the thermodynamic data for matched and mismatched oligonucleotides under different concen- 
trations and ionic strengths. This program is available for download via http://serverl.medikc.med.uni-goettingen.de/ 
meltcalc.htm or it may be requested by email. A total of 87 melting points resulting from eleven different published 
55 [16-26] and 14 unpublished hybridization probe assays where calculated and showed an excellent fit with the predic- 
tions from the n-n model, as may be seen from Figure 5. For the data shown in Figure 5 Eq. 4b was used to calculate 
the Na+ equivalents which gave a better fit when using the published Tm data from different laboratories. However, 
under the standard PCR conditions used with 2.5 mM Mg 2+ and 5% DMSO the best prediction were obtained using 
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Eq 4a. 

[0057] Genotyping assays for the apoltpoprotein B-3500 and factor V Leiden mutation have been performed. For 
both loci different mutations have been described that occur under the detection probe. These are the G10699A and 
C10698T mutations of apolipoprotein B [27] and the G1691A, A1692C and G1689A mutations for factor V [20;28]. 
Increasing knowledge on the prevalence of single nucleotide polymorphisms in the human genome [29;30] raises the 
important question about the sensitivity of hybridization probe assays towards different mutations under a probe. Melting 
point calculations for different possible mutations under a 19mer probe led us speculate that no mutation under a 
properly designed probe will remained undetected. However, there are different mutations possible that destabilize the 
probe binding to the same extent [12]. These mutations will not be discriminated from each other. Fig. 3 shows how 
19mer probes with different GC content are destabilized by a single base mutation. The most important trend is that 
probes with higher GC content show less Tm shift due to a single mutation. Another principal point is that shorter 
probes show higher Tm shifts because the destabilizing impact of the mutation on the total stability is higher in shorter 
duplexes. The thermodynamic n-n model provides the theoretical basis for the stability of matched and mismatched 
oligonucleotides. The comparison of experimentally derived melting points with those calculated from the n-n model 
show very good agreement. 

Using a detection probe that has been properly optimized according to this model it may be appriciated that the detection 
of every point mutation should be possible. However, it is possible that different mutations under a probe result in the 
same Tm reduction. Important factors for the design of hybridization probes are: (I.) Absence of significant self- or 
cross dimerization of probes and amplification primers. (II.) Equal melting points of detection and anchor probe and 
absence of false priming sites on the amplicon for hybridization probes used for quantification. (III.) Use of a detection 
probe with a maximum Tm shift caused by the investigated mutation (see below) and an anchor probe with a Tm 
approximately 1 0% higher than that of the detection probe for mutation detection. Provided it is not in the penultimate 
or ultimate position the place where the mutation is located under the probe is not Important. The resulting melting 
point shift will be affected by the type of mutation and the neighboring bases but not significantly by the position itself. 
The commonest mutations are G->A mutations [31]. However, the resulting G:T mismatch under the probe is among 
the most stable mismatches which occur together with G:A and G:G mismatches [9]. It is therefore advisable to place 
the detection probe on the antisense strand. The resulting complementary C:A mismatch has a higher destabilizing 
effect and Is more readily detected. (IV.) The gap between the fluorophores should be as small as possible since the 
FRET efficiency is inversely proportional to the sixth power of the distance between the donor and the acceptor dye 
[32]. One to five basepairs gap has been reported to give good results [1]. 
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Table 1. 



Unified parameters for oligonucleotide AH" and AS" in 1 M NaCI [5J. 
(Nearest neighbor dimer duplexes are presented with a slash separating strands in antiparallel orientation, e.g. AT/ 
TA means 5 , -fiiT-3 > Watson-Crick paired with 3*-TA-5'. ) 


Sequence 


AH* 

kcal/mol 


AS* 

cal/Kxmol 


AA/TT 


-7.9 


-22.2 


AT/TA 


-7.2 


-20.4 


TA/AT 


-7.2 


-21.3 


CA/GT 


-8.5 


-22.7 


GT/CA 


-8.4 


-22.4 


CT/GA 


-7.8 


-21.0 


GA/CT 


-8.2 


-22.2 


! CG/GC 


-10.6 


-27.2 


; GC/CG 


-9.8 


-24.4 


GG/CC 


-8.0 


-19.9 


Initiation with terminal G-C 


0.1 


-2.8 


Initiation with terminal A-T 


2.3 


4.1 


Symmetry correction for self complimentary oligos 


0 


-1.4 



Table 2. 



30 


Parameters for mismatched oligonucleotide AH" and AS" in 1 M NaCI. 
(References are given in the table. Nearest neighbor dimer duplexes are presented with a slash separating strands 
in antiparallel orientation, e.g. AT/TG means 5'-AT-3' mismatched with 3'-TG-5\ Mismatches are in the second po- 
sition of the listed doublet except for double mismatches. These parameters are not valid for the calculation of 
mismatches in the terminal or penultimate position.) 


35 


Sequence 


AH° 

kcal/mol 


AS° 

cal/Kxmol 




Internal G-A mismatches [9] 






40 


AA/TG 


-0.6 


-2.3 


AG/TA 


-0.7 


-2.3 




CA/GG 


-0.7 


-2.3 




CG/GA 


-4.0 


-13.2 


45 


GA/CG 


-0.6 


-1.0 




GG/CA 


0.5 


3.2 




TA/AG 


07 


0.7 


50 


TG/AA 


3.0 j 


7.4 


Internal C-T mismatches [8] 








AC/TT 


0.7 


0.2 




AT/TC 


-1.2 


-6.2 


55 


CC/GT 


-0.8 


-4.5 




CT/GC 


-1.5 


-6.1 
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Table 2. (continued) 





oequence 


A l_IO 

An 

kcal/mol 


cal/Kxmol 


5 


Internal C-T mismatches [8] 








GC/CT 


2.3 


5.4 




GT/CC 


5.2 


13.5 


10 


TC/AT 


1.2 


0.7 


TT/AC 


1.0 


0.7 




Internal A-C mismatches [7] 








AA/TC 


2.3 


4.6 


15 


AC/TA 


5.3 


14.6 




CA/GC 


1.9 


3.7 




CC/GA 


0.6 


-0.6 


20 


GA/CC 


5.2 


14.2 




GC/CA 


-0.7 


-3.8 




TA/AC 


3.4 


8.0 




TC/AA 


76 


20.2 


25 


Internal G-T mismatches [3] 








AG/TT 


1.0 


0.9 




AT/TG 


-2.5 


-8.3 


30 


CG/GT 


-4.1 


-11.7 




CT/GG 


-2.8 


-8.0 




GG/CT 


3.3 


10.4 




GT/CG 


-4.4 


-12.3 


35 


TG/AT 


-0.1 


-1.7 




TT/AG 


-1.3 


-5.3 




Internal A-A mismatches [10] 






40 


AA/TA 


1.2 


1.7 




CA/GA 


-0.9 


-4.2 




GA/CA 


-2.9 


-9.8 




TA/AA 


4.7 


12.9 


45 


Internal C-C mismatches [10] 








AC/TC 


0.0 


-4.4 




CC/GC 


-1.5 


-7.2 


50 


GC/CC 


3.6 


8.9 




TC/AC 


6.1 


16.4 




Internal G-G mismatches [10] 








AG/TG 


-3.1 


-9.5 


55 


CG/GG 


-4.9 


-15.3 




GG/CG 


-6.0 


-15.8 
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Table 2. (continued) 



Sequence 


An 

kcal/mol 


cal/Kxmol 


Internal G-G mismatches [1 0] 






TG/AG 


1.6 


3.6 


Internal T-T mismatches [10] 






AT/TT 


-2.7 


-10.8 


CT/GT 


-5.0 


-15.8 


GT/CT 


-2.2 


-8.4 


TT/AT 


0.2 


-1.5 


Double mismatches [3] 






GG/TT 


5.8 


16.3 


GT7TG 


4.1 


9.5 


TG/GT 


-1.4 


-6.2 



70 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Claims 

1 . A method of automatically selecting oligonucleotide hybridization probes for detecting a mutation causing a mis- 
match in a DNA duplex, the method comprising the steps of: 

defining the mutation to be detected by 

a DNA sequence of a mutant DNA strand and 

a DNA sequence of a wild-type DNA strand corresponding to the mutant DNA strand; 

defining an integer variable L1 for a first length of the oligonucleotide hybridization probes from the place of 
mutation in a first direction; 

defining a range from ml to nl of possible values of the variable L1 ; 

defining an integer variable L2 for a second length of the oligonucleotide hybridization probes from the place 
of mutation in a second direction opposite to the first direction; 
defining a range from m2 to n2 of possible values of the variable L2; 
selecting one or more kind(s) of the oligonucleotide hybridization probes from 

probes which have an one-to-one hybridization fit to the mutant DNA strand, 

probes which have an one-to-one hybridization fit to the wild-type DNA strand corresponding to the mutant 
DNA strand, 

probes which have an one-to-one hybridization fit to an opposite sense mutant DNA strand which is the 
complementary DNA strand to the mutant DNA strand, and 

probes which have an one-to-one hybridization fit to an opposite sense wild-type DNA strand which is the 
complementary DNA strand to the wild-type DNA strand corresponding to the mutant DNA strand; 

determining all possible hybridization probes from the possible values of the variables L1 and L2, and the kind 
(s) of probes; 

defining a thermodynamic nearest-neighbor model for calculating a melting point of a certain hybridization 
probe hybridized to a DNA strand, the melting point being a temperature at which a predetermined percentage 
of a multitude of identical pairs of said certain hybridization probe and said DNA strand is in an annealed state; 
calculating, on basis of the thermodynamic nearest-neighbor model, for all possible hybridization probes 

a first melting point of the respective probe hybridized with its complementary mutant DNA strand or op- 
posite sense mutant DNA strand, respectively, 

a second melting point of the respective probe hybridized with its complementary wild-type DNA strand 
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or opposite sense wild-type DNA strand, respectively, and 

a temperature difference between the second melting point and the first melting point; and 

ranking the possible hybridization probes with regard to the temperature difference. 

The method of claim 1 , and further comprising the steps of: 

defining a temperature range of a hybridization assay to be employed for detecting the mutation using the 
oligonucleotide hybridization probes; and 

checking for each of the possible hybridization probes whether both the first and the second melting points 
are within the temperature range of the hybridization assay; wherein the step of ranking the possible hybridi- 
zation probes with regard to the temperature difference includes: 

ranking only those possible hybridization probes which have both their melting points within the temperature 
range of the hybridization assay. 

The method of claim 2, wherein the step of defining the temperature range of the hybridization assay to be employed 
for detecting the mutation includes: 

defining the temperature range within a range from 20 to 80 °C. 

The method according to any of the claims 1 to 3, and further comprising the step of: 

defining at least one physico-chemical parameter of a hybridization assay to be employed for detecting the 
mutation using the oligonucleotide hybridization probes; wherein the step of defining a thermodynamic nearest- 
neighbor model includes: 

defining the thermodynamic nearest-neighbor model dependently of the at least one physico-chemical param- 
eter of the hybridization assay. 

The method of claim 4, wherein the step of defining at least one physico-chemical parameter of the hybridization 
assay to be employed for detecting the mutation includes: 

defining at least one ion concentration as one of the physico-chemical parameters. 

The method of claim 5, wherein the step of defining at least one ion concentration as one of the physico-chemical 
parameters includes: 

defining a Na+ cation equivalent concentration. 

The method according to any of the claims 1 to 6, and further comprising the steps of: 

defining an integer number N of the hybridization probes to be automatically selected; and 
making a list of the N top ranked possible hybridization probes. 

The method according to one of the claims 1 to 7, wherein the step of ranking the possible hybridization probes 
with regard to the temperature difference includes: 

ranking the possible hybridization probes separately for each of the kinds of the oligonucleotide hybridization 
probes. 

The method according to one of the claims 1 to 8, and further comprising the step of: 

checking, for each of the possible hybridization probes, whether it is able to hybridize either to the mutant DNA 
strand and the opposite sense mutant DNA strand, or to the wild-type DNA strand and the opposite sense 
wild-type DNA strand in more than one way; 

wherein the step of ranking the possible hybridization probes with regard to the temperature difference includes: 

ranking only those possible hybridization probes which are neither able to hybridize to the mutant DNA strand 
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and the opposite sense mutant DNA strand, nor to the wild-type DNA strand and the opposite sense wild-type 
DNA strand in more than one way. 

10. The method according to one of the claims 1 to 9, and further comprising the steps of: 

defining at least one further mutation which may occur on the mutant DNA strand or the wild-type DNA-strand 
or the opposite sense wild-type DNA strand; 

checking, for each of the possible hybridization probes, whether it is sensitive to the further mutation in the 
same way as to the mutation; 

wherein the step of ranking the possible hybridization probes with regard to the temperature difference includes: 

ranking only those possible hybridization probes which are not sensitive to the further mutation in the same 
way as to the mutation. 
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